# Common Voice adaptation

Whisper Medium Catalan
Apache-2.0
This is a speech recognition model fine-tuned on the Catalan Common Voice 11.0 dataset based on OpenAI Whisper Medium.
Speech Recognition Transformers Other
W
shields
19
2
Wav2vec2 Large Xlsr 53 Tr Fine Tuning Deprecated
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers
W
bekirbakar
17
0
Wav2vec2 Large Xlsr Kyrgyz
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Kyrgyz Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition Other
W
iarfmoose
22
2
Wav2vec2 Large Xlsr Turkish
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Turkish Common Voice dataset based on the facebook/wav2vec2-large-xlsr-53 model, achieving a test WER of 21.13%.
Speech Recognition Other
W
cahya
61
2
Wav2vec2 Large Xlsr 53 W2V2 TATAR SMALL
Apache-2.0
This model is a Tatar automatic speech recognition model fine-tuned on the Common Voice 8 dataset based on facebook/wav2vec2-large-xlsr-53, with a test set WER of 53.16%.
Speech Recognition Transformers Other
W
emre
30
1
Wav2vec2 Large Xlsr 53 Polish
Apache-2.0
XLSR-53 large model speech recognition system optimized for Polish, fine-tuned based on facebook/wav2vec2-large-xlsr-53, supports Polish automatic speech recognition
Speech Recognition Other
W
jonatasgrosman
412.13k
11
Wav2vec2 Large Xlsr Sorbian
Apache-2.0
A speech recognition model fine-tuned on Common Voice Upper Sorbian data based on facebook/wav2vec2-large-xlsr-53, supporting automatic speech recognition tasks for Upper Sorbian.
Speech Recognition Other
W
iarfmoose
51
0
Wav2vec2 Large Xlsr Estonian
Apache-2.0
This is an Estonian automatic speech recognition (ASR) model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.
Speech Recognition Other
W
m3hrdadfi
26
0
Wav2vec2 Large Xlsr Breton
Apache-2.0
A speech recognition model fine-tuned on the Breton Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Other
W
cahya
25
1
Xlsr En Punctuation
Apache-2.0
Fine-tuned automatic speech recognition model based on facebook/wav2vec2-large-xlsr-53 on the English Common Voice dataset, supporting punctuation prediction
Speech Recognition English
X
boris
30.28k
3
Wav2vec2 Large Xlsr 53 Turkish
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Turkish Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53 model.
Speech Recognition Other
W
dundar
23
1
Wav2vec2 Large Xlsr Kyrgyz
Apache-2.0
A Kyrgyz speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on Common Voice dataset with a word error rate of 34.08%.
Speech Recognition Other
W
aismlv
571
2
Wav2vec2 Large Xlsr Persian
Apache-2.0
A fine-tuned automatic speech recognition model for Persian (Farsi) based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.
Speech Recognition Other
W
m3hrdadfi
562
16
Wav2vec2 Large Xlsr Or
Apache-2.0
Automatic speech recognition model fine-tuned on Odia language based on Facebook's wav2vec2-large-xlsr-53 model
Speech Recognition Other
W
danurahul
22
0
Wav2vec2 Xls R 300m Br Small
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the Common Voice dataset, supporting Breton (br) speech recognition tasks.
Speech Recognition Transformers Other
W
emre
17
0
Wav2vec2 Large Xlsr 53 Russian
Apache-2.0
A Russian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input
Speech Recognition Other
W
jonatasgrosman
3.9M
54
Wav2vec2 Large Xlsr 53 Spanish
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Spanish Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition Spanish
W
mrm8488
38
2
Wav2vec2 Large Xlsr 53 Lithuanian
Apache-2.0
An automatic speech recognition model fine-tuned for Lithuanian using the Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition Other
W
anton-l
29
0
Wav2vec2 Large Xlsr Welsh
Apache-2.0
An automatic speech recognition model fine-tuned on the Welsh Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, achieving a test WER of 29.4%.
Speech Recognition Other
W
Srulikbdd
386
0
Wav2vec2 Large Xlsr Pa IN
Apache-2.0
A speech recognition model fine-tuned on the Punjabi Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
W
danurahul
26
2
Wav2vec2 Xls R 300m As CV8 V1
Apache-2.0
Assamese (Assamese) speech recognition model fine-tuned on the Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
emre
21
0
Wav2vec2 Large Xlsr Greek 2
Apache-2.0
A speech recognition model fine-tuned on the Greek Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, balancing the training set with synthesized female voice data
Speech Recognition Transformers Other
W
skylord
15
0
Wav2vec2 Xls R 300m Lg
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the COMMON_VOICE - LG dataset, supporting automatic speech recognition tasks for Luganda (lg).
Speech Recognition Transformers Other
W
samitizerxu
22
0
Wav2vec2 Xls R 300m Cs Cv8
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 8.0 Czech dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
comodoro
13
1
Wav2vec2 Large Xlsr Italian
Apache-2.0
An Italian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 13.91% on the Common Voice Italian test set
Speech Recognition Other
W
joaoalvarenga
27
2
Wav2vec2 Large Xlsr Mongolian
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Mongolian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Other
W
bayartsogt
16
1
Wav2vec2 Xls R 1b Ro
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Romanian Common Voice 7.0 dataset based on facebook/wav2vec2-xls-r-1b.
Speech Recognition Transformers Other
W
ubamba98
16
0
Wav2vec2 Large Xlsr Tamil
Apache-2.0
An automatic speech recognition model fine-tuned on the Tamil language using the Common Voice dataset, based on facebook/wav2vec2-large-xlsr-53.
Speech Recognition Other
W
manandey
50
0
Wav2vec2 Large Xlsr 53 Portuguese
Apache-2.0
This is a fine-tuned XLSR-53 large model for Portuguese speech recognition tasks, trained on the Common Voice 6.1 dataset, supporting Portuguese speech-to-text conversion.
Speech Recognition Other
W
jonatasgrosman
4.9M
32
Wav2vec2 Xlsr Chuvash
Apache-2.0
A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Chuvash automatic speech recognition tasks.
Speech Recognition Other
W
gagan3012
54
0
Wav2vec2 Xls R 300m Pa IN R5
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Punjabi (India) dataset based on the facebook/wav2vec2-xls-r-300m model.
Speech Recognition Transformers
W
DrishtiSharma
25
0
Wav2vec2 Xls R 300m Romanian
Apache-2.0
A Romanian speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, achieving a WER of 12.46% on the Common Voice Romanian test set
Speech Recognition Transformers
W
Dumiiii
24
0
Wav2vec2 Large Xlsr 53 Tamil
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Tamil Common Voice dataset based on facebook/wav2vec2-large-xlsr-53.
Speech Recognition Other
W
Amrrs
32.87k
6
Wav2vec2 Large Xlsr 53 Georgian
Apache-2.0
This is a Georgian automatic speech recognition (ASR) model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.
Speech Recognition Other
W
MehdiHosseiniMoghadam
44
1
Wav2vec2 Large Xlsr 53 Breton
Apache-2.0
This is a Breton automatic speech recognition model based on the XLSR Wav2Vec2 architecture, fine-tuned on the Common Voice dataset.
Speech Recognition Other
W
Marxav
27
1
Xlsr 300m CV 8.0 50 EP New Params Nl
Apache-2.0
This is an automatic speech recognition (ASR) model based on the XLS-R architecture with 300M parameters, specifically optimized for Dutch and trained on the Common Voice 8.0 dataset.
Speech Recognition Transformers Other
X
Iskaj
25
0
Xlsr300m Cv 7.0 Nl Lm
Apache-2.0
XLS-R-300M is an automatic speech recognition (ASR) model specifically optimized for Dutch, trained on the Common Voice 8 Dutch dataset.
Speech Recognition Transformers Other
X
Iskaj
23
0
Xls R Ab Test
This is an automatic speech recognition model fine-tuned on the COMMON_VOICE - AB dataset, based on the XLS-R Dummy architecture
Speech Recognition Transformers Other
X
FitoDS
22
0
Wav2vec2 Xls R 300m Rm Sursilv D11
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Romansh-Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-300m, achieving a 24.09% Word Error Rate (WER) on the Common Voice 8 test set.
Speech Recognition Transformers
W
DrishtiSharma
20
0
Wav2vec2 Xls R 300m Kk N2
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on Kazakh (KK) speech datasets based on the facebook/wav2vec2-xls-r-300m model.
Speech Recognition Transformers Other
W
DrishtiSharma
15
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase